144 research outputs found

    Identification and characterization of thousands of bacteriophage satellites across bacteria

    Get PDF
    Bacteriophage-bacteria interactions are affected by phage satellites, elements that exploit phages for transfer between bacteria. Satellites can encode defense systems, antibiotic resistance genes, and virulence factors, but their number and diversity are unknown. We developed SatelliteFinder to identify satellites in bacterial genomes, detecting the four best described families: P4-like, phage inducible chromosomal islands (PICI), capsid-forming PICI, and PICI-like elements (PLE). We vastly expanded the number of described elements to ∼5000, finding bacterial genomes with up to three different families of satellites. Most satellites were found in Proteobacteria and Firmicutes, but some are in novel taxa such as Actinobacteria. We characterized the gene repertoires of satellites, which are variable in size and composition, and their genomic organization, which is very conserved. Phylogenies of core genes in PICI and cfPICI indicate independent evolution of their hijacking modules. There are few other homologous core genes between other families of satellites, and even fewer homologous to phages. Hence, phage satellites are ancient, diverse, and probably evolved multiple times independently. Given the many bacteria infected by phages that still lack known satellites, and the recent proposals for novel families, we speculate that we are at the beginning of the discovery of massive numbers and types of satellites

    The major subunit of widespread competence pili exhibits a novel and conserved type IV pilin fold

    Get PDF
    Type IV filaments (T4F), which are helical assemblies of type IV pilins, constitute a superfamily of filamentous nanomachines virtually ubiquitous in prokaryotes that mediate a wide variety of functions. The competence (Com) pilus is a widespread T4F, mediating DNA uptake (the first step in natural transformation) in bacteria with one membrane (monoderms), an important mechanism of horizontal gene transfer. Here, we report the results of genomic, phylogenetic, and structural analyses of ComGC, the major pilin subunit of Com pili. By performing a global comparative analysis, we show that Com pili genes are virtually ubiquitous in Bacilli, a major monoderm class of Firmicutes. This also revealed that ComGC displays extensive sequence conservation, defining a monophyletic group among type IV pilins. We further report ComGC solution structures from two naturally competent human pathogens, Streptococcus sanguinis (ComGCSS) and Streptococcus pneumoniae (ComGCSP), revealing that this pilin displays extensive structural conservation. Strikingly, ComGCSS and ComGCSP exhibit a novel type IV pilin fold that is purely helical. Results from homology modelling analyses suggest that ComGC unusual structure is compatible with helical filament assembly. Because ComGC displays such a widespread distribution, these results have implications for hundreds of monoderm species

    A widespread family of phage-inducible chromosomal islands only steals bacteriophage tails to spread in nature

    Get PDF
    Phage satellites are genetic elements that couple their life cycle to that of helper phages they parasitize, interfering with phage packaging through the production of small capsids, where only satellites are packaged. So far, in all analyzed systems, the satellite-sized capsids are composed of phage proteins. Here, we report that a family of phage-inducible chromosomal islands (PICIs), a type of satellites, encodes all the proteins required for both the production of small-sized capsids and the exclusive packaging of the PICIs into these capsids. Therefore, this new family, named capsid-forming PICIs (cf-PICIs), only requires phage tails to generate PICI particles. Remarkably, the representative cf-PICIs are produced with no cost from their helper phages, suggesting that the relationship between these elements is not parasitic. Finally, our phylogenomic studies indicate that cf-PICIs are present both in gram-positive and gram-negative bacteria and have evolved at least three times independently to spread in nature

    Atypical AT Skew in Firmicute Genomes Results from Selection and Not from Mutation

    Get PDF
    The second parity rule states that, if there is no bias in mutation or selection, then within each strand of DNA complementary bases are present at approximately equal frequencies. In bacteria, however, there is commonly an excess of G (over C) and, to a lesser extent, T (over A) in the replicatory leading strand. The low G+C Firmicutes, such as Staphylococcus aureus, are unusual in displaying an excess of A over T on the leading strand. As mutation has been established as a major force in the generation of such skews across various bacterial taxa, this anomaly has been assumed to reflect unusual mutation biases in Firmicute genomes. Here we show that this is not the case and that mutation bias does not explain the atypical AT skew seen in S. aureus. First, recently arisen intergenic SNPs predict the classical replication-derived equilibrium enrichment of T relative to A, contrary to what is observed. Second, sites predicted to be under weak purifying selection display only weak AT skew. Third, AT skew is primarily associated with largely non-synonymous first and second codon sites and is seen with respect to their sense direction, not which replicating strand they lie on. The atypical AT skew we show to be a consequence of the strong bias for genes to be co-oriented with the replicating fork, coupled with the selective avoidance of both stop codons and costly amino acids, which tend to have T-rich codons. That intergenic sequence has more A than T, while at mutational equilibrium a preponderance of T is expected, points to a possible further unresolved selective source of skew

    Bacteriophages benefit from generalized transduction

    Get PDF
    Temperate phages are bacterial viruses that as part of their life cycle reside in the bacterial genome as prophages. They are found in many species including most clinical strains of the human pathogens, Staphylococcus aureus and Salmonella enterica serovar Typhimurium. Previously, temperate phages were considered as only bacterial predators, but mounting evidence point to both antagonistic and mutualistic interactions with for example some temperate phages contributing to virulence by encoding virulence factors. Here we show that generalized transduction, one type of bacterial DNA transfer by phages, can create conditions where not only the recipient host but also the transducing phage benefit. With antibiotic resistance as a model trait we used individual-based models and experimental approaches to show that antibiotic susceptible cells become resistant to both antibiotics and phage by i) integrating the generalized transducing temperate phages and ii) acquiring transducing phage particles carrying antibiotic resistance genes obtained from resistant cells in the environment. This is not observed for non-generalized transducing temperate phages, which are unable to package bacterial DNA, nor for generalized transducing virulent phages that do not form lysogens. Once established, the lysogenic host and the prophage benefit from the existence of transducing particles that can shuffle bacterial genes between lysogens and for example disseminate resistance to antibiotics, a trait not encoded by the phage. This facilitates bacterial survival and leads to phage population growth. We propose that generalized transduction can function as a mutualistic trait where temperate phages cooperate with their hosts to survive in rapidly-changing environments. This implies that generalized transduction is not just an error in DNA packaging but is selected for by phages to ensure their survival

    Conditions for the Evolution of Gene Clusters in Bacterial Genomes

    Get PDF
    Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model), genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters

    A synthesis of bacterial and archaeal phenotypic trait data.

    Full text link
    A synthesis of phenotypic and quantitative genomic traits is provided for bacteria and archaea, in the form of a scripted, reproducible workflow that standardizes and merges 26 sources. The resulting unified dataset covers 14 phenotypic traits, 5 quantitative genomic traits, and 4 environmental characteristics for approximately 170,000 strain-level and 15,000 species-aggregated records. It spans all habitats including soils, marine and fresh waters and sediments, host-associated and thermal. Trait data can find use in clarifying major dimensions of ecological strategy variation across species. They can also be used in conjunction with species and abundance sampling to characterize trait mixtures in communities and responses of traits along environmental gradients

    CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Clustered Regularly Interspaced Palindromic Repeats (CRISPRs) are a novel type of direct repeat found in a wide range of bacteria and archaea. CRISPRs are beginning to attract attention because of their proposed mechanism; that is, defending their hosts against invading extrachromosomal elements such as viruses. Existing repeat detection tools do a poor job of identifying CRISPRs due to the presence of unique spacer sequences separating the repeats. In this study, a new tool, CRT, is introduced that rapidly and accurately identifies CRISPRs in large DNA strings, such as genomes and metagenomes.</p> <p>Results</p> <p>CRT was compared to CRISPR detection tools, Patscan and Pilercr. In terms of correctness, CRT was shown to be very reliable, demonstrating significant improvements over Patscan for measures precision, recall and quality. When compared to Pilercr, CRT showed improved performance for recall and quality. In terms of speed, CRT proved to be a huge improvement over Patscan. Both CRT and Pilercr were comparable in speed, however CRT was faster for genomes containing large numbers of repeats.</p> <p>Conclusion</p> <p>In this paper a new tool was introduced for the automatic detection of CRISPR elements. This tool, CRT, showed some important improvements over current techniques for CRISPR identification. CRT's approach to detecting repetitive sequences is straightforward. It uses a simple sequential scan of a DNA sequence and detects repeats directly without any major conversion or preprocessing of the input. This leads to a program that is easy to describe and understand; yet it is very accurate, fast and memory efficient, being O(<it>n</it>) in space and O(<it>nm</it>/<it>l</it>) in time.</p
    • …
    corecore